The Semantic Structure of Roget’s Thesaurus Cross-References
نویسنده
چکیده
This study analyzed a database version of Roget’s Thesaurus (Roget’s International Thesaurus, 3rd Edition, 1962) for connectivity patterns among cross-references in order to identify the implicit conceptual structure. Semantic patterns implicit in the data, at both the local and global levels of the Thesaurus structure, are identified.
منابع مشابه
Roget's Thesaurus as a Lexical Resource for Natural Language Processing
WordNet proved that it is possible to construct a large-scale electronic lexical database on the principles of lexical semantics. It has been accepted and used extensively by computational linguists ever since it was released. Some of its applications include information retrieval, language generation, question answering, text categorization, text classification and word sense disambiguation. I...
متن کاملAnalysis and Construction of Noun Hypernym Hierarchies to Enhance Roget’s Thesaurus
Lexical resources are machine-readable dictionaries or lists of words, where semantic relationships between the terms are somehow expressed. These lexical resources have been used for many tasks such as word sense disambiguation and determining semantic similarity between terms. In recent years some research has been put into automatically building lexical resources from large corpora. In this ...
متن کاملDisambiguating Hypernym Relations for Roget's Thesaurus
Roget’s Thesaurus is a lexical resource which groups terms by semantic relatedness. It is Roget’s shortcoming that the relations are ambiguous, in that it does not name them; it only shows that there is a relation between terms. Our work focuses on disambiguating hypernym relations within Roget’s Thesaurus. Several techniques of identifying hypernym relations are compared and contrasted in this...
متن کاملA Comparison of WordNet and Roget's Taxonomy for Measuring Semantic Similarity
This paper presents the results of using Roget’s International Thesaurus as the taxonomy in a semantic similarity measurement task. Four similarity metrics were taken from the literature and applied to Roget’s. The experimental evaluation suggests that the traditional edge counting approach does surprisingly well (a correlation of r=0.88 with a benchmark set of human similarity judgements, with...
متن کاملThe Design and Implementation of an Electronic Lexical Knowledge Base
Thesauri have always been a useful resource for natural language processing. WordNet, a kind of thesaurus, has proven invaluable in computational linguistics. We present the various applications of Roget’s Thesaurus in this field and discuss the advantages of its structure. We evaluate the merits of the 1987 edition of Penguin’s Roget’s Thesaurus of English Words and Phrases as an NLP resource:...
متن کامل